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Abstract 

Tracking  control  of  a  class  of  nonlinear,  uncertain,  multi-input,  multiple-output  systems  is 
addressed  in  this  paper.  The  control  system  architecture  uses  neural  networks  for  function  ap¬ 
proximation,  certainty  equivalent  control  inputs  to  cancel  plant  dynamics  and  smoothed  sliding 
mode  control  to  insure  that  the  trajectories  remain  bounded.  Lyapunov  analysis  is  used  to 
derive  equations  for  the  sliding  mode  control,  neural  network  training,  and  to  show  uniform 
ultimate  boundedness  of  the  closed  loop  system.  Stability  analysis  results  are  shown  for  single¬ 
input  single-output  and  two-input  two-output  systems.  Results  are  then  extended  to  the  more 
general  multiple-input  multiple-output  case  where  the  number  of  inputs  is  equal  to  the  number 
of  outputs.  Simple  simulation  examples  are  used  to  illustrate  control  system  performance. 


1  Introduction 

The  goal  of  this  work  is  to  apply  intelligent  control  methods  to  high  performance  manned  air¬ 
craft  and  Unmanned  Air  Vehicles  (UAVs)  [9,  21].  These  systems  are  characterized  by  multiple 
input/output  variables,  large  nonlinearities,  significant  uncertainties,  configuration  variations,  and 
planform  changes  through  decades  of  operation.  Through  all  this,  the  control  system  is  required  to 
maintain  the  highest  levels  of  performance,  guarantee  stability  and  robustness,  and  to  meet  very 
stringent  reliability  requirements.  To  meet  these  stability  and  performance  requirements,  control 
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designers  typically  require  very  accurate  models.  Therefore,  a  large  amount  of  time  and  effort  is 
needed  to  reduce  the  plant’s  dynamic  model  uncertainties  to  tolerable  levels.  Some  research  work 
has  focused  on  reducing  modeling  uncertainty  and  associated  costs  by  developing  control  system 
design  methods  that  use  available  model  information  generated  off-line  to  define  the  nominal  con¬ 
trol  system,  and  then  use  gathered  data  and  known  relationships  between  parameters  to  refine  the 
control  system  on-line.  In  this  way,  some  stability  and  control  performance  requirements  can  be  met 
while  reducing  the  overall  system  design  costs  [3]. 

Flight  control  system  designs  have  historically  relied  on  linear  time  invariant  (LTI)  models  of  the 
form 


x  =  Ax  +  Bu 
y  =  Cx 

where  x ,  y,  and  u  are  the  state  vector,  output  vector  and  control  vector  respectively.  The  A,  or 
stability  matrix  is  based  upon  stability  derivatives  and  B ,  or  control  matrix  is  based  upon  the  control 
derivatives.  The  C  is  the  output  matrix  based  on  how  states  are  combined  to  form  outputs. 

The  A  and  B  matrices  also  depend  on  nominal  air  speed  and  air  density.  For  this  reason  they  can 
vary  greatly  from  one  flight  condition  to  another,  so  designs  are  typically  carried  out  at  a  number 
of  points  in  the  flight  envelope  and  then  blended  together  [18].  The  performance  of  these  control 
systems,  which  are  referred  to  as  Gain  Scheduled  Controllers ,  is  dependent  on  the  accuracy  of  a  small 
perturbation  model.  Historically,  gain  scheduled  control  has  worked  relatively  well  because  aircraft 
dynamics  are  predominantly  a  function  of  states  such  as  Mach,  altitude,  and  angle  of  attack.  These 
matrices  are  very  seldom  represented  as  a  function  of  the  controls  themselves  because  the  controls 
respond  faster  than  states  and  control  effectiveness  (represented  by  B)  is  predominantly  independent 
of  the  control  input,  u. 

In  this  work,  we  directly  address  the  plant  nonlinearities  by  representing  the  plant  using  a  form 
which  includes  nonlinear  functions  of  the  state  and  which  is  affine  with  respect  to  the  control.  We  also 
assume  that  we.have  full-state  measurement,  removing  the  need  for  nonlinear  observers,  which  makes 
the  problem  considerably  less  complicated.  The  resulting  generic  model  form  is  well  known  and  given 
by  x  —  f(x)+g(x)u.  A  variety  of  approaches  to  control  system  design  have  been  proposed  to  directly 
address  system  nonlinearities.  These  include  feedback  linearization  [12],  dynamic  inversion  [27], 
sliding  mode  control  [14],  and  backstepping  [16].  Feedback  linearization  transforms  the  plant  into  a 
linear  (or  partially  linear)  system  by  a  suitable  nonlinear  state  transformation  and  cancellation  of  the 
nonlinearities  by  feedback  control.  Backstepping  is  similar  to  feedback  linearization,  but  does  not 
rely  on  model  inversion  and  does  not  require  the  designer  to  cancel  helpful  nonlinearities.  Further, 
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backstepping  is  able  to  deal  with  plants  where  certain  matching  conditions  are  not  satisfied. 

Both  feedback  linearization  and  backstepping  rely  on  cancellation  of  known  nonlinearities.  To 
address  the  issue  of  uncertainty,  several  techniques  have  been  developed:  (i)  adaptive  methods  deal 
with  parametric  uncertainty  [13,  26,  33],  where  the  nonlinearities  are  assumed  to  be  known  but  some 
of  the  parameters  that  multiply  these  nonlinearities  are  unknown  or  uncertain;  (ii)  robust  methods 
deal  with  the  case  where  known  upper  bounds  on  the  unknown  nonlinearities  are  available  [1,  5] 
and  therefore,  they  tend  to  be  conservative,  sometimes  leading  to  high-gain  feedback;  (iii)  robust 
adaptive  methods  combine  parametric  uncertainty  and  unknown  nonlinearities  with  partially  known 
bounds  [24,  35]. 

The  above  control  techniques  are  based  on  the  assumption  that  the  plant  nonlinearities  are  either 
known  or  can  be  bounded  by  some  known  functions.  In  many  applications,  including  control  of  high 
performance  aircraft  systems,  some  of  the  nonlinearities  need  to  be  approximated  on-line.  This  may 
be  due  to  modeling  errors  during  the  identification/modeling  phase  or,  quite  often,  due  to  time- 
variations  in  the  dynamics  as  a  result  of  changes  in  the  operating  conditions  or  due  to  component 
wear  or  damage.  To  address  the  issue  of  unknown  nonlinearities,  various  control  system  architectures 
have  incorporated  neural  networks  as  on-line  approximators  of  unknown  nonlinearities  [34].  These 
control  systems  are  often  labeled  connectionist ,  intelligent  or  neural.  Early  work  in  neural  control 
attempted  to  define  the  role  of  the  neural  networks  within  the  control  system  [20]  and  much  of  this 
work  dealt  largely  with  the  role  of  the  network,  the  distinction  between  adaptive  and  neural  control, 
and  learning  paradigms  [8],  While  research  stressing  these  topics  justifiably  continues,  some  have 
been  driven  by  application  requirements  to  stress  closed-loop  system  stability.  Such  is  the  case  with 
flight  control  system  design. 

This  stability  focused  work  on  neural  control  uses  networks  as  a  component  in  a  mathematical 
framework  from  which  adaptive  control  laws  can  be  derived  and  stability  guarantees  can  be  made. 
Because  networks  are  typically  nonlinear,  they  are  often  used  in  combination  with  nonlinear  control 
methods  and  Lyapunov  analysis  is  commonly  the  stability  analysis  tool  employed.  Typically,  the 
feedback  control  law  and  the  adaptive  law  for  updating  the  network  weights  are  derived  by  utilizing 
a  Lyapunov  function,  whose  time  derivative  is  forced  to  have  some  desirable  stability  properties  (for 
example,  negative  definiteness).  Therefore,  the  stability  of  the  closed-loop  system  is  obtained  during 
the  synthesis  of  the  adaptive  control  laws.  Examples  of  this  type  of  approach,  which  is  referred  to 
as  Lyapunov  synthesis  method,  include  [2,  4,  6,  7,  15,  19,  22,  23,  25,  28,  29,  32,  30,  36]. 

Most  of  the  results  in  neural  control  based  on  the  Lyapunov  synthesis  method  are  derived  for 
systems  with  a  single  control  input.  The  multivariable  problem,  especially  if  the  control  inputs 
are  multiplied  by  unknown  nonlinearities,  becomes  more  challenging  due  to  the  coupling  between 
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control  inputs.  In  this  paper  we  present  a  control  design  approach  which  can  be  applied  to  a  class 
of  nonlinear,  affine,  Multiple-Input  Multiple-Output  (MIMO)  plants  with  full-state  measurement, 
where  the  number  of  inputs  is  the  same  as  the  number  of  outputs.  The  control  objective  is  to  achieve 
tracking  of  some  desired  state  trajectories.  An  adaptive  bounding  technique  is  employed  to  handle 
the  unknown  network  reconstruction  error  approximation  [19,  25].  Sufficient  stabilizability  condi¬ 
tions  on  the  unknown  control  multiplier  functions  are  derived  and  both  neural  control  components 
and  sliding  mode  control  signals  are  combined  in  proportions  determined  by  the  accuracy  of  the 
function  approximation.  The  Lyapunov  synthesis  approach  is  used  to  derive  a  neural  control  system 
with  guaranteed  stability  properties. 

This  paper  is  organized  as  follows:  We  first  formulate  the  Single-Input  Single-Output  (SISO) 
problem  in  Section  2  by  showing  the  plant  model  structures  to  be  considered,  define  some  of  the 
parameters  and  assumptions,  and  define  the  control  structure.  Analytical  results  are  then  used 
to  show  stability  properties  of  the  proposed  scheme.  A  SISO  example  is  provided  in  Section  3  to 
illustrate  the  concepts.  Section  4  extends  the  definitions,  assumptions,  control  architecture  and 
stability  analysis  to  a  class  of  Two-Input  Two-Output  (TITO)  systems  and  an  illustrative  example 
for  this  is  shown  in  Section  5.  The  control  system  design  is  then  extended  to  MIMO  systems  of 
arbitrary  size  in  Section  6  and  conclusions  are  stated  in  Section  7. 

2  Neural  Control  of  a  SISO  System 

We  start  by  considering  the  neural  control  problem  for  a  SISO  system.  This  illustrates  some  of  the 
issues  that  arise  in  dealing  with  unknown  nonlinearities  multiplying  the  control  input  and  provides 
a  convenient  framework  for  the  extension  into  the  MIMO  domain. 

Consider  the  problem  of  tracking  control  for  a  SISO  plant  given  by 

x  =  Wi(a:)  +  w2  (x)u  (1) 

where  the  measurable  state  variable  x(t)  belongs  to  a  domain  of  interest  Q  C  R,  u  €  R  is  the  control 
input  and  w i(x),w2(x)  are  unknown  nonlinearities  which  are -assumed  to  be  locally  Lipschitz  (to 
assure  uniqueness  of  solutions).  The  control  objective  is  to  design  a  feedback  control  law  such  that 
x(t)  follows  some  desired  trajectory  Xd(t)  as  closely  as  possible. 

It  is  often  the  case  that  a  control  designer  has  a  rough  estimate  of  the  characteristics  of  the  plant 
dynamics  either  through  analytical  modeling  or  through  empirical  studies.  The  control  approach 
being  used  here  directly  allows  for  this  type  of  information  to  be  used  by  having  w\  (x),  w2(x)  each  be 
composed  of  a  known  part  and  by  an  unknown  part  [23,  25].  However,  for  notational  simplification, 
in  this  paper  the  functions  in  equation  (1)  are  assumed  to  be  completely  unknown. 
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Since  wi{x)  and  iu2(x)  are  unknown,  they  are  approximated  for  use  in  the  control  system.  In 
this  work  we  use  linearly  parameterized  approximators  of  the  form 

Wi(x)  =  0^.£(x)  fori  =1,2  (2) 

where  Wi(x)  and  u)2(#)  are  estimates  of  the  unknown  functions,  9Wl ,  0W2  G  Mm  are  the  parameter 
vector  estimates  and  t;(x)  :  M  »->  Rm  is  a  vector  of  strictly  positive  basis  functions.  For  example, 
Radial  Basis  Functions  (RFB)  networks  using  Gaussian  functions  can  be  used  in  this  framework. 
As  we  will  see  later  on,  the  assumption  that  each  element  of  $(ar)  is  positive  is  used  to  deal  with 
a  stabilizability  problem.  In  general,  a  different  set  of  basis  functions  £Wl(s) >$11*0*0  could  be  used 
in  approximating  W\{x)  and  W2{x )  respectively.  However,  in  order  to  simplify  the  notation  (and 
without  any  loss  of  generality)  we  let  $(#)  $u;i  (x)  —  $ttfa0*0’ 

For  each  i  —  1,2  the  best  approximation  for  Wi(x)  is  defined  as 

w*(x)  =  0*J^(x)  for  i  =  1,2  (3) 

where  0JJ,  denotes  the  parameter  vector  that  minimizes  the  difference  between  Wi(x)  and  0^(x) 
for  all  x  in  the  domain  of  interest,  H,  i.e., 

ewt  =  arS  „  min  {sup  |toi(*)  -  Ol^(x) |}  (4) 

vWl  GM 

and 

0*W2  =  arg  mm  {sup  | w2(x)  -  C2?(x)|}  (5) 

0W2€QW2  X£Q 

where  ©W2  C  Rm  is  the  set  of  all  0W2  which  lie  within  parameter  bounds  defined  later.  We  use  these 
best  possible  parameters  given  by  equations  (4)  and  (5)  to  define  the  parameter  vector  estimation 
errors  as 

QWi  =  0Wi  -  0*Wi  for  i  =  1,2.  (6) 

It  is- important  to  note  that  unless  the  actual  functions,  tt)j(x),- are  linear  combinations  of  the  f(x) 
basis  functions,  there  will  be  some  residual  errors  remaining  in  each  approximation  even  if  the  best 
6Wi  vector  is  used.  In  this  work,  bounds  on  the  approximation  errors  of  W\{x)  are  addressed  using 
an  adaptive  bounding  method  while  W2(x)  has  fixed  bounds  to  insure  well-defined  controls. 

The  error  which  remains  after  the  best  fit  has  been  achieved  is  referred  to  as  the  reconstruction 
error  and  is  given  by 

SWi(x)  =  Wi(x)  -  W*(x).  (7) 
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The  reconstruction  error  SWi  ( x )  is  an  important  quantity  in  neural  control  design  methods  since 
it  provides  a  measure  of  how  close  an  approximation  can  be  achieved  by  a  given  neural  network 
architecture.  In  general,  as  the  number  of  nodes  in  the  network  increase,  the  reconstruction  error 
becomes  smaller.  In  the  limit,  as  the  number  of  nodes  becomes  infinitely  large  (and  centers  are 
appropriately  placed),  the  universal  approximation  theorem  states  that  the  reconstruction  error 
converges  to  zero  (over  a  compact  domain)  [10].  Unfortunately,  universal  approximation  results  are 
relevant  only  if  the  number  of  nodes  start  becoming  very  large.  Therefore,  nonzero  reconstruction 
error  is  something  that  needs  to  be  dealt  with  in  practical  applications.  In  this  work,  we  define  an 
uPPer  bound  (which  is  assumed  to  be  unknown)  on  the  magnitude  of  SWi  (x)  as  given  by 

Ki  =  sup  \SWi  (x)l  (8) 

In  the  approximation  of  wi(x),  the  necessity  to  assume  a  priori  knowledge  of  a  bound  on  the 
reconstruction  error  is  removed  by  developing  an  adaptive  bounding  scheme  where  the  bound  on 
$wi (#)  is  estimated  on-line.  We  define  this  estimate  of  the  reconstruction  error  bound  as 
and  the  bounding  estimation  error,  $Wl  (t),  which  will  be  used  in  the  stability  analysis  is  defined  as 


$vfi  (0  —  'tpivy  {t)  'fiwi  *  (9) 

where  ^  =  max  }  and  ^  is  a  design  parameter  that  will  appear  in  the  adaptation  law 

for  updating  ipWl  (t). 

Next,  we  define  the  tracking  error,  e{t)  =  x(t)  -  xd(t),  which  is  used  to  obtain  the  sliding  mode 
scalar,  s,  where 


s{t)  =  e(t)  +  c  f  e(r)dr,  (10) 

Jo 

and  where  c  is  a  positive  design  constant.  As  we  will  see  later  on,  s(£)  is  used  to  generate  the 
feedback  control  law,  as  well  as  the  adaptive  laws  for  updating  the  network  weights.  By  defining  the 
sliding  mode  quantity  in  this  way,  we  obtain  a  filtered  error  which  induces  integral  control  action 
[17,  31]. 


The  parameter  vectors,  0Wl  and  0W2 ,  and  the  adaptive  bound,  i/>Wl ,  are  updated  according  to  the 
following  adaptive  laws! 


-  TiAs  -  vKVwi  ~  (11) 

0W2  =  7  us£  (12) 

1>vn  =  7{s  tanh(^)  -  a(ipWl  -  (13) 

where  6^Ui  and  ,  are  design  parameters  (representing  a  priori  estimates  of  the  unknown  network 
parameters  and  adaptive  bound)  and  a  >  0  is  a  leakage  constant  used  in  the  framework  of  the  cr- 
modification  [11]  to  prevent  the  parameter  estimates  from  exhibiting  parameter  drift.  The  constant 
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7  >  0  denotes  the  adaptation  gain,  and  the  e  >  0  is  a  small  design  constant  used  to  smooth  out  the 
sign  function  by  converting  it  into  a  hyperbolic  function  (tanh).  As  shown  in  the  subsequent  analysis, 
these  update  laws  are  derived  based  on  the  Lyapunov  synthesis  approach  to  maintain  stability  and 
prevent  parameter  drift. 

The  network  reconstruction  error  associated  with  the  w2(x)  function  is  treated  differently  than 
the  one  associated  with  the  wi{x).  Typically,  there  is  more  a  priori  information  about  the  range  of 
possible  values  for  w2(x ).  Furthermore,  the  adaptive  bounding  technique  developed  for  Wi(x)  (by 
the  use  of  the  adaptive  parameter  i>Wl(t))  cannot  be  directly  applied  to  the  case  of  w2(x)  due  to  a 
stabilizability  problem.  Therefore,  it  is  assumed  that  upper  and  lower  bounds  on  w2(x)  are  available 
such  that  w2(x)  >  w2(x)  >  w2{x)  >  0  VxGfl.  These  upper  and  lower  bounds  are  chosen  such  that 
there  exist  parameter  vectors  0W2  and  0W2  satisfying  w2(x)  =  9W2£(x)  and  w2(x)  —  9^2£(x).  The 
parameter  vectors  9W2,9W2  are  then  used  to  bound  the  parameter  estimate  vector  9W2.  It  is  worth 
noting  that  this  approach  can  also  be  applied  to  systems  where  w2(x)  <  0  Vx  €  ^  by  changing  the 
input  sign  convention. 

Using  the  previously  defined  quantities,  we  are  now  ready  to  define  the  feedback  control  signal 
as 


C2£  +  ^ w2  sgn {uas) 


(14) 


where  ua  is  defined  as 


ua  =  xd  —  ce  —  k$  —  -  j>Wl  tanh(-) 


(15) 


and  A:  is  a  positive  design  constant.  As  we  will  see  later,  the  adaptive  law  for  0W2  will  be  modified 
to  ensure  that  the  denominator  of  the  control  equation  is  bounded  away  from  zero. 

We  begin  the  stability  analysis  by  defining  the  Lyapunov  function  as 

2 


V(S)9Wi,'tf)Wl)  —  2 5  "^"27  d"  2'y^' 


2 

W\ 


(16) 


where  s  is  given  by  equation  (10),  ^^(t)  is  given  by  equation  (9),  and  9Wi  is  given  by  equation  (6). 
Taking  the  derivative  of  V  with  respect  to  time,  substituting  for  s  and  adding  a  zero  sum  term  we 
get 

2 


V  =  s{u>i  +W2U- Xd  + ce  + (ua  -  ua)}  + -'^OT.Ovi  + 

7  i=l  ^ 


(17) 


7 


Substituting  (15)  and  (7)  into  (17)  we  get 

s  1  2 

V  =  s{(u>!  +  w2u  -  ks-el i>Wi  tanh(^)  -  u0}  +  -{*T 0Z.6Wi  +  } 

C  7  i  =  \ 

=  -ks2  +  s{5Wl  -  0^,  £  +  ( w2u  -ua)~  if>Wl  tanh(-)} 

1  2 

+z(zl^v>i  (18) 

7  i=  1 

Then  by  using  to  bound  SWl  and  using  equation  (9)  we  can  obtain  an  upper  bound  on  V  as 
follows: 


v  <  -ks2  +  \s\rwi  -  S^1  tanh(^)  -  sxj)Wi  tanh(^)  -  +  s{w2u  -  ua)} 

1  ^  ^ 

^Wi^wj  4-  ifiwiipwi}'  (19) 

7  i= 1 

To  bound  the  terms  involving  l ,  we  use  a  property  of  the  hyperbolic  tangent  [23]  according  to 
which  for  any  i/6l  and  any  constant  e  >  0 


0  <  \u\  -  i/tanh(-)  <  K€ 


(20) 


where  k  —  0.2786. 

By  using  equations  (11),  (13)  and  (20)  in  (19),  we  can  write  the  inequality  as 


V  <  ks  +  K€'lpWl  cr^wii^wi  —  )  —  V&wy  (@ivi  ”  ®wi  )  +  s{v)2U  —  Ua)} 

1--  . 

+~0XV26w2-  (21) 

Since  the  denominator  of  the  control  law  described  by  (14)  will  be  designed  such  that  it  remains 
positive  for  all  t  >  0,  it  can  be  readily  concluded  that  sgn(uas)  =  sgn(us).  Now,  by  substituting 
(14)  into  (21),  we  obtain 

V  <  ~ks2+  neij)*Wl  -  (T,tpWl  (ipWl  -  i>°Wi)  -  <jdli  (0Wl  -  e°Wl ) 

+su{w2  -  +  VC2  sgn(us)]}  +  ^elJWi 

—  ~^s  4-  —  cripwi (V* w,  ~  VCi)  ~  a^un  (0u>i  ~  ^,)  +  suSW2  —  |su|^,2 

~su  (22) 

Now  using  equation  (12)  and  knowing  that  tf>*2  >  5m  we  arrive  at  a  bound  on  the  Lyapunov 
derivative  given  by 

V  <  -ks2  +  KerWi  -  cr{i>Wl  (i>Wl  -<,,)  +  K,  -  0°J}.  (23) 
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By  completing  the  squares  it  can  be  shown  that 


v  <  -ks2  +  Kerwi  -  a-{0TwieWl  +  (0W1  -  0°wf{BWl  -  e°wi)  -  (01X  -  e°wf{d*wx  - e°wx)} 

\  {MW1  +  -  rwi)ttvn  -  rwi)  -  «  -  owe  -  cj>-  (24) 

Because  (0Wl  -  0°Vi)T(9Wl  -  0°t)  >  0  and  (ipWl  -  ~  CJ  >  0,  equation  (24)  implies  that 

V  <  - ks 2  +  * erwi  -  \0lxK  +  |{(^  -  01  -  0°W1)} 

+  |{(C1  -  rwinrwi  -  <)}•  (25) 

We  would  like  to  be  able  to  make  stability  conclusions  about  the  system  by  putting  the  system  into 
the  form  of 

V  <  -6V  +  A  (26) 

where  b  €  R+  and  A  is  a  group  of  constant  terms.  To  satisfy  (26)  it  suffices  to  insure  that 
-ks2  +  nerwi  -  y>ZM  +  \  {(C  -  0°Wlf(0k  ~  O}  - 

+  2'{(^t«i  -  _  V’wi)}  -  ~2s2  ~  27^1^1  _  2^4>wl'*i,wi  +  A  (27) 

where  we  define 

6  <  min{2/c,<77}  (28) 

A  =  «e<+|{(C-^1)T(C-^1)  +  (C1-C1)(C1-C1)}-  (29) 

From  (26),  we  can  see  that  V  is  not  negative  definite  (nor  negative  semi-definite).  Therefore  asymp¬ 
totic  convergence  cannot  be  concluded.  However,  for  bounded  A,  the  solution,  V ,  can  be  shown  to 
be  uniformly  ultimately  bounded  (u.u.b). 

The  solution  to  inequality  (26)  satisfies 

pt 

V  <  V0e~bt  +  A  /  e~b{t~T)dr 

Jo 

=  e~bt{V0-±}  +  j.  (30) 

Therefore,  using  equation  (16)  and  (30)  we  obtain  that  for  any  p  >  there  exits  a  T  such  that 
for  all  t  >T,  the  variable  s  in  the  Lyapunov  equation  satisfies 

m\<P.  (31) 

This  implies  that  the  components  of  V  are  also  u.u.b.  The  bound  given  by  (31)  also  implies  that  the 
tracking  error  is  bounded  as  shown  below.  Define  u)(i)  =  f*  e(r)dr .  Using  this  we  can  write  (25)  as 

Cj  =  —cuj  +  s .  (32) 
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Since  equation  (32)  represents  a  linear  system  driven  by  a  bounded  input,  s,  we  know  that  the  state, 
e,  is  bounded  also.  Thus,  we  have  uniformly  ultimately  bounded  tracking  error. 

We  now  address  the  well-definedness  of  the  control  by  restricting  the  adaptation  law  of  the 
parameter  which  multiplies  the  control  signal.  In  order  to  insure  that  the  control  u(t)  remains 
bounded,  the  denominator  in  equation  (14)  must  be  bounded  away  from  zero.  We  insure  this  by 
forcing  $W2  such  that  >  |^J.  Since  we  know  that  the  approximation  of  w2  is  bounded  from 
below  by  w2(x ),  we  require  w2  >  |<Siu2|.  Therefore,  to  insure  bounded  control  we  impose  the  limits 
on  the  approximation,  turning  off  the  update  of  the  parameter  vector  elements  if  the  update  would 
drive  them  below  0W2j  where  j  refers  to  the  jth  element  of  the  parameter  vector.  Since  we  have  also 
assumed  an  upper  bound  on  the  true  functions,  we  turn  off  the  parameter  updates  if  the  update 
would  drive  them  above  0W2j.  Since  this  will  leave  additional  terms  in  the  Lyapunov  derivative,  we 
must  consider  the  stability  impact. 

To  impose  these  bounds  on  the  parameter  elements,  we  now  modify  the  parameter  rule  given  in 
equation  (12).  Each  element  of  the  parameter  vector  is  now  updated  according  to 


Qwij  —  \ 


0  if  {0W2j  <  $W2j  and  7 su^  <  0} 

0  if  {0W2j  >  0w2j  anc*  7 su£j  >  0} 

7  su£j  otherwise. 


(33) 


Elements  of  the  parameter  vector,  0W2j  that  are  updated  using  the  nominal  parameter  update  equa¬ 
tion,  (12),  will  not  leave  residual  terms  in  the  Lyapunov  derivative.  However,  those  elements  of  the 
parameter  vector  that  have  their  update  zeroed,  leave  terms  in  the  Lyapunov  derivative  that  must 
be  considered. 

Let  us  designate  the  elements  in  the  parameter  vector  that  would  violate  their  upper  bound  if 
updated  according  to  (12)  as  0W2i.  Let  us  assume  that  these  parameter  elements  are  q  in  number. 
Similarly,  we  define  those  parameter  elements  that  would  violate  their  lower  bounds  by  using  (12)  as 
0W2j  and  assume  that  these  are  r  in  number.  Thus,  the  terms  that  would  be  added  to  the  Lyapunov 
derivative  due  to  the  zeroing  of  the  parameter  updates  would  have  the  form 

Q  r 

“  ^  ^  @W2i  ~  ^  1  @W2j  .  (34) 

»-l  j= 1 


The  first  term  of  (34)  can  be  written  as 

Q 

-  -  0*wJs<i-  (35) 

i= 1 

Since  we  know  that  the  0W2.  is  at  the  upper  limit  of  the  ith  element  of  the  parameter,  0W2  ,  and  the 
optimal  value,  0*^,  is  below  the  maximum  value;  we  can  conclude  that  (0Wii  -  6^.)  >  0.  Also, 
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since  we  know  that  the  nominal  update  for  0W2 .  is  increasing,  7  >  0,  and  because  £(x)  >  0  Vx,  then 
we  know  that  us  >  0.  Thus,  we  know  that  the  first  term  of  (34)  is  negative,  and  therefore,  does  not 
de- stabilize  the  system. 

The  second  term  of  (34)  can  be  written  as 

(36) 

j= 1 

Here  we  know  that  the  0W2j  is  at  the  lower  limit  of  the  ith  element  of  the  parameter,  6W^ ,  and  the 
optimal  value,  ,  is  above  the  minimum  value;  we  can  conclude  that  ( 0W2j  —&w2j)  —  0*  Also,  since 
we  know  that  the  nominal  update  for  0W 2  is  decreasing,  7  >  0,  and  because  £(x)  >  0  Vx,  then  we 
know  that  us  <  0.  Thus,  we  know  that  the  second  term  of  (34)  is  negative  and  does  not  de-stabilize 
the  system  either.  Therefore,  the  system  tracking  errors  and  parameter  errors  are  guaranteed  to  be 
uniformly  ultimately  bounded  and  the  control  is  bounded  for  all  x  values  in  the  domain,  ft. 

The  problem  we  have  described  is  tracking  control  for  a  SISO,  affinely  represented  model  with 
parametric  uncertainty  due  to  unknown  0^  and  0^2,  bounding  uncertainty  due  to  the  unknown 
reconstruction  error  bounds,  ip^Vl ,  and  unknown  but  bounded  W2-  The  neural  networks  approximate 
the  unknown  system  nonlinearities  ,  the  vector  sliding  mode  control  with  adaptive  bounding 
insures  boundedness,  and  the  integral  action  improves  transient  response. 


3  SISO  Simulation  Example 

To  illustrate  the  neural  control  system  design,  a  SISO  model  has  been  defined  in  the  form  of  equation 
(1)  which  is  affine  in  the  controls  and  has  nonlinear  characteristics.  The  model  doesn’t  represent  a 
physical  system,  but  it  does  possess  an  uncertain  state  dependent  nonlinear  quantity,  uq(x),  and  an 
uncertain  control  multiplier,  W2 ,  where 

Wl(x)  =  2  +  0.5 x  +  0.2500 or2  -0.0104a;3  -  0.0078 x4  (37) 

W2  =  5.  (38) 

We  define  the  uncertainty  used  to  bound  W2  as  A2  where 

A2  =  0.5.  (39) 

Further,  we  define  the  domain  of  the  state,  x,  as  ft  —  [—5, 5]. 

The  desired  trajectory,  x d,  is  given  by  a  combination  of  sine  waves  of  various  frequencies,  square 
waves  and  a  saw  tooth  function.  This  trajectory  provides  abundant  excitation  for  the  function 
approximation  in  some  parts  of  the  state  space. 


11 


The  function  approximation  is  performed  by  a  linear-in-the-parameters  neural  network  of  the 
form  given  in  equation  (2).  The  basis  functions,  £(s),  are  Radial  Basis  Functions  (RBFs)  consisting 
of  51  evenly  spaced  functions  over  a  domain  of  [-5,5]  with  standard  deviation  of  0.15. 

Simulation  results  have  been  generated  for  the  SISO  plant  defined  above,  x and  the  control 
structure  given  in  Section  2.  To  illustrate  impact  of  learning  on  closed  loop  system  performance, 
the  neural  networks  were  initialized  with  weights  that  resulted  in  ^(z)  that  was  about  half  of  the 
true  function  and  W2(x)  «  w2(x).  Closed  loop  tracking  results  are  shown  in  Figure  1  using  three 
separate  five  second  time  slices  of  one  time  history.  These  plots  show  how  the  tracking  performance 


Figure  1:  Tracking  Performance 


improves  as  time  progresses.  The  top  plot  shows  sluggish  performance  during  the  start  of  the 
simulation  where  the  function  approximation  is  relatively  poor.  The  middle  plot  shows  that  the 
performance  is  improved  during  the  interval  between  20  and  25  seconds  due  to  improved  function 
The  bottom  plot  show  the  tracking  between  120  and  125  seconds  where,  at  that  point 
in  the  simulation,  the  function  approximation  is  very  good  and  the  traces  essentially  lie  on  top  of 
each  other.  Thus,  as  time  progresses,  the  control  system  learns  to  improve  its  tracking  performance. 

During  the  125  seconds  of  simulation  the  weights  of  the  neural  network  function  approximators 
adjust.  This  can  be  seen  in  Figure  2,  which  show  the  function  approximation  error  for  w\  (left)  and 
W2  (right).  At  the  beginning  of  the  simulation  the  weights  associated  with  xb\  were  set  to  values 
that  made  the  approximation  about  half  of  the  true  value  of  the  function  throughout  the  entire 
x  domain.  To  provide  a  valid  basis  for  evaluation  of  the  control  system’s  function  approximation, 
the  best  possible  approximation  for  w\  was  found  off-line  using  a  gradient  algorithm  and  random 
samples  from  the  domain  until  the  error  grew  very  small.  Thus,  in  Figure  2  we  see  the  starting  point 
for  the  simulation,  the  best  possible  approximation,  and  the  approximation  that  results  after  125 
seconds  of  simulation.  It  is  clear  that  the  approximation  is  greatly  improved  after  the  simulation 
over  some  of  the  domain.  It  should  not  be  surprising  that  the  fit  is  nearly  identical  to  the  initial 
approximation  in  regions  of  the  domain  that  the  state  did  not  reach.  Because  of  the  local  nature  of 
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the  basis  functions  used,  the  approximation  will  only  train  in  regions  that  are  visited  by  the  state. 


Figure  2:  Function  approximation  of  w\  (on  the  left)  and  W2  (on  the  right). 


In  the  case  of  u>2,  uncertainty  bounds  are  imposed  on  the  weights  as  shown  in  equation  (33).  This 
causes  the  approximation  of  the  network  to  be  constrained  to  lie  within  upper  and  lower  bounding 
limits.  The  uncertainty  bound  given  by  equation  (39)  implies  that  the  value  of  W2  should  remain 
between  4.5  and  5.5  for  all  values  of  x.  However,  to  place  bounds  on  the  approximation  here  we  use 
the  best  approximation  of  the  upper  and  lower  bounds  to  provide  upper  and  lower  bounds  for  0W2. 

Figure  2  (right)  shows  the  results  of  the  W2  approximation.  The  best  approximation  of  the 
upper  and  lower  bounds  are  shown  as  dashed  and  dash-dot  lines  while  the  approximation  after  the 
simulation  is  complete  is  shown  by  the  solid  line.  The  simulation  is  initialized  with  the  0W2  =  0W2 .  For 
this  reason,  approximation  at  the  end  of  the  simulation  is  nearly  identical  to  the  best  approximation 
of  the  lower  bound  in  regions  of  the  domain  not  visited. 

It  is  clear  from  Figure  2  that  the  approximations  improve  greatly  during  the  course  of  the 
simulation.  This  is  largely  the  cause  of  the  improved  tracking  shown  in  Figure  1. 

4  Two-Input  Two-Output  Control  Design 

In  many  systems,  including  control  of  high  performance  aircraft  systems,  it  is  common  to  have 
multiple  control  inputs  affecting  the  state  variables.  One  of  the  approaches  for  handling  such  systems 
is  to  use  decoupling  methods  for  isolating  the  effect  of  a  control  input  on  a  measured  output. 
However,  such  methods  require  that  the  designer  know  a  priori  the  dynamics  of  the  system  such  that 
coupling  between  control  inputs  can  be  cancelled  (or  minimized).  The  problem  becomes  considerably 
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more  difficult  if  the  system  is  nonlinear  and  some  of  these  nonlinearities  are  unknown.  A  key  challenge 
is  to  handle  unknown  nonlinearities  multiplying  the  control  inputs.  In  this  section,  we  design  a 
neural  control  scheme  for  a  two-input  two-output  system  and  analyze  the  stability  properties  of  the 
closed-loop  system. 

We  formulate  the  two-input  two-output  (TITO)  problem  as  an  extension  of  the  SISO  problem 
and  consider  the  problem  of  tracking  control  for  a  TITO  plant  with  full  state  measurement.  The 
state  vector  is  defined  as  X,  where  X  =  [x  z]T,  where  x  and  z  are  the  states,  (x,z)  €  Q  C  M2,  and 
ft  is  a  domain  of  interest.  The  control  vector  is  defined  as  U,  where  U  =  [u  v]T  and  u  and  v  are  the 
control  inputs.  The  plant  dynamics  for  the  system  are  written  as 


X  =  WA  +  WBU 


(40) 


where  WA  is  a  vector  of  functions,  IVg  is  a  matrix  of  functions  and  they  are  given  by 


WA 


Wi(x,z) 

w2(x,z) 


and  Wb  = 


W3(x,z)  W4  (x,z) 
w5(x,z)  w6(x,z) 


(41) 


The  functions,  wt(x,  z)  for  i  =  1, ...  ,6,  are  assumed  to  be  unknown  and  the  system  represented  by 
equation  (40)  is  assumed  to  be  stabilizable.  The  control  system  objective  is  to  use  the  control  inputs 
signals,  u  and  v,  to  make  the  states,  x  and  z,  track  externally  provided  desired  reference  trajectories, 
Xd  and  Zd,  respectively. 

The  unknown  functions  and  their  associated  uncertainties  are  defined  for  the  TITO  system  in  a 
manner  similar  to  what  was  used  for  the  SISO  system.  Each  of  the  unknown  functions  is  modeled 
using  a  linearly  parameterized  combination  of  basis  functions: 


Wi(x,z)  =  0Z£(x,z)  for  i  =  (42) 

where  z)  is  the  m  x  1  vector  of  basis  functions  and  $Wi  are  the  m  x  1  parameter  vectors,  similar 
to  those  defined  in  equation  (2)  for  the  SISO  formulation.  Just  as  the  true  functions,  t u*,  were  used 
to  define  vector  and  matrix  quantities  in  (41),  function  approximations  ihi  are  used  to  define  Wa 
and  Wb‘ 

The  best  parameter  vector  and  best  approximation  for  each  unknown  function  are  defined  ac¬ 
cording  to  equations  (43)  and  (44)  respectively. 

e*m  =  arg  min  {  sup  \wi(x,z)  -  &l^(x,z)\}  (43) 

(x,z)€n 

K  (x,  z)  =  0*T$(x,z)  (44) 

The  reconstruction  errors  are  given  by 


SWi  (x,  z)  =  Wi  (x,  z)  -  w*  (x,  z) 


(45) 
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and  for  w\  and  W2  we  define  the  reconstruction  error  bounds  on  equation  (45)  as  where 


'4)Wi  —  SUP  |  $Wi(xiz)\-  (^) 

(x,z)€Cl 

We  further  define  a  bound,  =  max{^,,^.},  where  is  a  design  parameter. 

In  the  following  neural  control  design,  the  reconstruction  error  is  addressed  using  an  adaptive 
bound  for  w\  and  w2.  However,  the  reconstruction  errors  in  the  approximations  of  the  functions 
which  multiply  the  control  inputs  are  dealt  with  differently.  These  reconstruction  errors,  6i  for 
i  =  3, . . . ,  6,  are  defined  as 


Ws(x,z)u  =  $3Tt;(x)u  +  Ss(x9Z9u) 

(47) 

W4(x,z)v  =  6lT£,(x)v  +  5±{x9z9v) 

(48) 

W5(x,z)u  =  #5T£(x)u -b  <55(2,  z,u) 

(49) 

Wq(x,z)v  =  0qT€(x)v  +  66{x,z9v). 

(50) 

For  these  functions  of  the  WB  matrix,  the  reconstruction  error  bounds  are  not  adaptive  and  are 
defined  as  SWi  -  max  \Sm  |  for  i  =  3, . . . ,  6  and  where  SWi  is  given  by  equation  (45). 

We  further  assume  that  the  functions,  w3...6,  are  within  some  known  bounds.  These  bounds  are 
defined  as  w^x,  z)  and  Wi{x ,  z)  where 

Wi{x,  z)  <  Wi(x,  z)  <  Wi{x ,  z).  (51) 


and  where  Wi(x,z)  =  0^(x,z)  and  w^x.z)  =  6F.£(x9z).  Therefore,  we  impose  the  restriction  on 
the  function  estimates  by  preventing  the  associated  parameter  vectors  from  exceeding  the  upper  and 
lower  parameter  vectors,  0Wi  and  0_w. . 

We  define  the  tracking  errors  for  the  x  output,  ex  =  x  —  x<i,  and  for  the  z  output,  ez  =  z  —  z^. 
We  then  use  these  errors  to  define  sliding  mode  variables  for  x  and  z  and  write  this  as  a  vector  of 
components 


s  — 


sx 

Cj;  4"  J  6xdT 

Sz 

ez  +  C2  f  ezdr 

(52) 


where  Ci  and  c 2  are  positive  design  constants.  Using  these  sliding  mode  quantities  in  the  control 
law  induces  proportional  plus  integral  control  action  [17,  31]. 

The  rules  for  updating  the  parameter  vectors  are  defined  similar  to  what  was  shown  in  Section 
2  and  are  given  by 


0Wl  -  ir{€sx  -  <r(0Wl  -  OnJ}  (53) 

eW2  =  (54) 
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and 


=  7 £«x  U 

(55) 

oWi 

=  7 isx  V 

(56) 

eWb 

=  7 u 

(57) 

Owe 

=  7 Cs2  V 

(58) 

where  a  is  a  leakage  constant  and  B°Wl  and  0°2  are  design  parameters.  The  update  rules  for  the 
adaptive  bounds  are  given  by 

=  7{«x  tanh(y)  -  a(i>Wl  - 

=  7 {s*  tanh(  j)  -  a(ijjW2  -  </j°2)} 

where  t/i®  ,  and  are  also  design  parameters. 

We  now  write  the  vector  of  control  inputs  as  U  where 

-Owi£  ~  tanh(^)  +  ±d  -  c\ex  -  ksx  -  (Sm  +  5W4)sgn(sx) 

-Ow2t  -  Tpw2  tanh(^)  +  zd  -  c2ez  -  ksz  -  (SWs  +  5Wli)sgn(sz) 

where  A;  is  a  positive  real  number. 

We  insure  the  existence  of  the  controls,  U ,  by  making  restrictions  that  guarantee  that  W q1  is 

defined.  We  require  that  the  sign  of  the  determinant  of  Wb  remains  constant  over  the  range  of 

possible  variations  of  the  function  approximation  and  we  also  require  that  each  of  the  elements  of 
WB  are  sign  definite.  We  assume  that  each  of  these  elements  are  positive  and 


U  =  W 


(59) 

(60) 


(61) 


^3^6  _  W4W5  >  0  V  (Z,  z)  €  ft 


(62) 


insuring  that  WB  has  a  positive  determinant. 

In  cases  where  the  sign  of  the  determinant  is  negative  or  the  sign  of  some  or  all  of  the  elements  of 
WB  are  negative,  restrictions  similar  to  (62)  can  be  made  so  long  as  the  elements  are  sign  definite. 

Stability  of  the  system  is  addressed  by  using  Lyapunov  analysis.  For  the  TITO  case,  consider 
the  Lyapunov  function  given  by 


v  =  Ists  +  y  +  yik 

2  k  ^ 


(63) 


where  7  >  0  is  the  adaptive  gain,  6Wi  -  6Wi  -  6*w.  for  i  =  1...6,  and  ij>Wi  =  xp w.  -  .  for  i  =  1,2. 

Taking  the  derivative  of  equation  (63)  and  using  equation  (52),  we  get 


C\€x  %d 
C2ez  ”  Zd 


eLL. 


+  st{wa  +  wBu }  +  £  +  £ 


'ifiiv,  'ipwj 


(64) 


*= 1 


i- 1,2 
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If  we  now  add  and  subtract  sTWA  and  sTWBU  in  equation  (64)  we  obtain 
V  =  sT{| 


-  %d 

+ 

$w\ 

oh 

£  + 

n 

» _ 

1 

1 

c2ez 

-  Zd 

$W2 

.  ^  . 

l 

<o 

1 

elA  ~elA 


6  oLL 


u  +  wA  +  wBu}  +  +  E 


Ipwi  Ip Wi 


(65) 


i—l 


»=  1,2 


Using  the  reconstruction  error  bounds  we  can  write  equation  (65)  as 
V  <  sT{ 


1 

1 _ 

; 

£  + 

&W3 

&W4 

1 

QlA 

i 

1  ^ 
■Trv 

_ I 

c2e*  -  zd 

I 

&W5 

&we 

1 

QwA 

- 1 

uy> 

CO 

U 


+  WA  +  WBU}  +  \s\7 


ft 


^wAui  +  ^ 


1=1 


<=1,2 


'tfiwj  'Ipwj 
7 


(66) 


where  |s|  =  [|sx|  |sz|]T. 

We  now  take  the  parameter  update  rules  shown  in  equations  (53-58)  and  plug  them  into  (66)  to 
obtain 

r  i  r  i  r 

1 

I  I 

^Wq 


c\ex  •fid, 

c2  -  Zd 


V  <  sT{ 

+  l»|T 

The  control  from  equation  (61)  is  substituted  into  (67)  to  obtain 

V  <  klT 


I 

*§ 

1 _ 

1 

1 

go 

L  ft* . 

[a  a  \ 

_  ol2(eW2  -  e°W2)  \ 

+  WA  +  WBU} 

+  E 


i=l,2 


tpWjTpVH 

7 


(67) 


’  rwi ' 

-sT 

{xpWl  tanh(^f)  +  ksx} 

-[a  al 

mft*  . 

{ipW2  tanh(*f- )  +  ksz}  _ 

L a  J 

/el2{eW2-e°m)  _ 

+  E 


<=1,2 
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Using  (20),  (59)  and  (60)  we  can  write  (68)  as 

V  <  ~-ksT s  +  rp^ne  +  ij>w2K€  -  [  °  a  ] 

which  can  also  be  written  as 

2 

V  <  -ksl  -  ks2z  +  i)*Wl K€  +  %p*W2 ne  -<7^2{6Wi (6m  -  0%,.)  +  rpWi (ipWi  -  ) }  • 


)  +  i’m  (<Pm  ~  C, ) 

0lJ0yj2  -0l2)+  (ipW2  -  CJ 


(68) 


(69) 


(70) 


i=l 


As  was  done  in  Section  2,  we  can  show  stability  [23]  where  V  <  —aV  +  A,  a  —  min(2fc,  <77)  and 


A  =  ne(rwi  +  CJ  +  f  J2{(0*Wi  -  e°wf{6*Wi  -  0°Wi)}  +  (^  -  <)2  +  (C2  -  C2)2-  (71) 


i=l 
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In  the  case  that  the  parameter  update  equations  (55-58)  attempt  to  drive  the  parameter  values 
of  {6Wi  |  i  =  3, . . . ,  6}  beyond  their  bounds,  we  zero  the  update  to  the  parameter.  The  terms  left 
remaining  in  the  Lyapunov  derivative  due  to  this  zeroing  are  similar  to  those  shown  in  (34).  Corre¬ 
spondingly,  the  stability  analysis  that  was  done  for  the  SISO  case  shown  in  Section  2  can  be  used  to 
show  zeroing  parameter  updates  does  not  de-stabilize  the  system  in  the  TITO  case  either.  There¬ 
fore,  we  obtain  uniform  ultimate  boundedness  of  the  system  tracking  errors,  the  parameters  and  the 
adaptive  bounds. 


5  TITO  Simulation  Example 


To  illustrate  the  neural  control  system  design,  a  model  has  been  defined  which  is  affine  in  the  controls 
and  has  nonlinear  characteristics. 

The  model  is  given  by 

wn(x)  +  Wi2(z)  W3  w4(x)  u 

W2\(x)  +  W22{z)  W5(x)  We  v 

where 


wn(x)  =  2  +  0.5 x  +  0.2500 x2  -  0.0104  x3  -  0.0078 x4  (73) 

wn(z)  =  z  (74) 

w2i(x)  =  5  +  x-  0.4821  x2  -0.0357x3  +  0.0179a:4  (75) 

W22  (z)  =  —  1.7z  (76) 

w>3  =  we  =  3  (77) 

w4(x)  =  w5(x )  =  x/8+1  (78) 


and  we  define  the  domain  of  the  plant  state  as  ft  =  {(a :,z)  |  x  €  [-5,5],  z  €  [-5,5]}. 

This  model  differs  slightly  from  the  model  given  by  equations  (40)  and  (41).  In  (72)  we  allow 
the  wi (x,  z)  and  w2(x,z)  terms  to  be  written  as  separate  functions  of  x  and  z.  We  also  write  w4 
and  W5  as  functions  of  x  only,  while  v's  and  wq  are  constants.  These  changes  simplify  the  function 
approximations  for  presentation  purposes. 

The  same  function  approximation  structure  is  used  here  for  the  TITO  example  as  was  used  for 
the  SISO  example.  Namely,  a  linear-in-the-parameters  neural  network  of  the  form  given  in  equation 
(2).  For  the  TITO  case,  each  set  of  Radial  Basis  Functions  (RBFs),  £(x)  and  £(z),  consists  of  51 
evenly  spaced  functions  over  a  domain  of  [-5, 5]  with  standard  deviation  of  0.1. 

The  desired  trajectories,  xj  and  Zrf.  are  given  by  a  combination  of  sine  waves  of  various  frequencies 
and  square  waves.  As  in  the  SISO  simulation,  these  trajectories  provide  abundant  excitation  for  the 
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function  approximation.  The  magnitudes  of  Xd  and  Zd  are  also  defined  to  allow  the  state  trajectories 
to  remain  within  ft. 

Simulation  results  have  been  generated  for  the  TITO  plant  defined  above,  Xd  and  Zd,  and  the 
control  structure  given  in  Section  4.  To  illustrate  the  impact  of  learning  on  closed  loop  system 
performance,  the  neural  networks  were  initialized  with  weight  values  that  resulted  in  poor  initial 
function  approximations.  Initial  0Wl  and  0W2  parameter  vectors  were  chosen  such  that  ^  0  and 
w2  «  0  throughout  ft.  The  weights  for  wi  where  i  =  3, . . . ,  6  are  initialized  to  the  worst  case  bounds, 
e.g.,  those  which  result  in  the  minimum  determinant  of  Wb .  Thus,  the  W3  and  w§  approximators 
are  initialized  with  6Ws  and  0WQ  and  and  ti5  are  initialized  with  0WA  and  0W6 .  Successful  system 
performance  was  to  be  demonstrated  by  tracking  performance  improvements  through  time  as  the 
function  approximators  more  accurately  mapped  the  system  dynamics. 

Closed  loop  tracking  results  are  shown  in  the  time  histories  in  Figures  3.  The  traces  on  the  left 
of  Figure  3  show  rather  poor  tracking  for  the  first  five  seconds  of  the  simulation.  This  is  due  to  the 
fact  that,  initially,  the  function  approximation  outputs  do  not  accurately  represent  the  dynamics. 
However,  after  25  seconds  of  simulation,  tracking  is  substantially  improved  as  can  be  seen  on  the 
right  two  plots.  The  improvement  in  the  tracking  of  the  closed-loop  system  is  due  to  the  improved 
function  approximation  that  results  as  the  simulation  runs. 


Figure  3:  On  the  left  the  initial  five  seconds  of  states  x(t)  and  z(t)  tracking  the  desired  trajectories 
Xd  and  Zd ;  on  the  right,  the  tracking  performance  is  shown  after  25  seconds. 

The  improvement  in  function  approximation  can  be  seen  in  Figures  4-5.  The  approximations 
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for  W\  and  w2  are  plotted  along  the  line  x  =  z  in  the  (x,  z )  plane  in  Figure  4.  This  provides 
some  indication  of  fit  over  the  entire  domain  without  the  complexity  of  surface  plots  with  multiple 
surfaces.  Figure  4  shows  the  function  approximation  after  the  simulation  is  complete,  the  initial 
approximation,  and  the  best  possible  approximation  given  the  structure  of  the  approximator.  It 
should  be  noted  that  although  the  approximations  improve  greatly  in  those  regions  of  the  {x,z) 
domain  visited  during  the  simulation,  the  approximation  remains  very  near  the  initial  value  in 
regions  not  visited  during  the  simulation.  This  is  due  to  local  nature  of  the  RBFs. 


Figure  4:  Function  approximation  of  W\(x,z)  (left)  and  w2{x,z)  (right). 


4 

-  best  approx. 

-  final  approx. 

3.5 

- initial  approx. 

3 

25 

x  Domain 


Figure  5:  Function  approximation  w3  and  w6  (left)  and  w4(x)  and  w5(x)  (right). 


Figure  5  shows  the  approximation  of  the  functions  which  multiply  the  control  inputs.  Since 
these  functions  depend  only  on  x,  these  plots  have  x  as  the  horizontal  axis  and  because  w 3  =  wq  and 
w4  —  tr?5,  only  two  plots  are  shown.  These  plots  show  the  best  approximation  to  the  upper  bounds 
and  lower  bounds,  and  the  approximation  after  the  simulation  has  run  for  30  seconds.  The  plots  in 
Figure  5  shows  good  approximation  of  the  true  function  value  in  parts  of  Q  which  are  visited  during 
the  simulation  and  remain  near  the  boundaries  in  other  regions  of  Q. 
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6  Application  to  MIMO  systems 


To  this  point  we  have  only  addressed  SISO  and  TITO  systems.  However,  the  approach  that  has 
been  applied  to  the  TITO  problem  is  applicable  to  MIMO  systems  with  full  state  output  where 
the  number  of  inputs  is  equal  to  the  outputs.  For  such  systems  the  equations  of  motion  written  in 
equation  (40)  can  be  written  in  a  more  general  form  as 

n 

x\  =  wi,i +y^wlyi+1u* 

i=l 

n 

x2  =  w2,i +y^w2,»+itt* 

i= 1 

n 

Xn  =  Wn,\  +'£2wU,i+lui  (79) 

i—  1 

where  wi}j  are  functions  of  the  states  (xi,x2,  ...£n)  and  basis  functions  are  therefore  functions  of  all 
states.  Functions  would  be  approximated  by  witj(x i,z2,  —xn)  where 

Wi,j(x  !,x2,...xn)  =  0jtj£(xi  ,x2,...xn).  (80) 

An  adaptive  bound  would  be  used  for  Wiy i  and  fixed  bounds  would  be  used  for  those  functions  which 
multiply  the  controls.  The  update  rules  would  be  extensions  of  equations  (53-60),  and  the  control, 
U  =  Wg1  L ,  would  be  an  n-dimensional  extension  of  equation  (61),  where  WB  would  become  an 
n-by-n  matrix  given  by 

•••£»)  Wl,3(*l»— ®n)  *•'  Wl,n+l(£l,— ®n) 

wB  =  •  •*.  \  .  (81) 

U)n,2  O^l  ?  ••*2'7i)  ^n,3  (xi  >  **’  0^T» 

and  where  the  matrix,  L ,  is  given  by 

-0Wifi£  -  tanh(^)  +  xu  -  CieXl  -  ksXl  -  (SWl  2  H - h  SWl  „+1)sgn(sXl) 

-0w2,i€  ~  4>w2tl  tanh(^f2-)  4-  x2d  -  cxeX2  -  ksX2  -  (SW22  + - h  5W2}U+1)sgn(sX2) 

jj  _ 

^n, i  tanh(-^“-)  +  %nd  cieXn  ksXn  {^wUl 2  T  *  *  ■  +  ^wn,n+i)sSn(^xn) 

Since  the  control  solution  given  above  includes  WJ1,  we  must  insure  that  the  determinant  of  WB 
maintains  one  sign.  Therefore,  this  matrix  must  be  checked  throughout  the  ranges  of  variations  to 
insure  that  the  sign  of  the  determinant  does  not  change.  This  approach  for  the  MIMO  case  results 
in  uniformly  ultimately  bounded  stability  just  as  was  shown  in  the  SISO  and  TITO  cases. 
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7  Conclusion 


A  neural  control  approach  for  a  class  of  nonlinear  systems  has  been  developed.  Lyapunov  analysis 
has  been  used  to  insure  bounded  tracking  errors  and  robust  adaptive  methods  prevent  parameter 
drift.  Results  have  been  illustrated  with  Single-Input  Single-Output  and  Two-Input  Two-Output 
plants.  Also,  extensions  to  more  general  Multiple-Input  Multiple-Output  plants  has  been  supported 
with  analysis. 
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